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Abstract — In this paper the detrended fluctuation analysis 
(DFA) method is introduced to characterize the self-similarity 
of DNA sequences. Using this method, four whole genomes of 
the pathogenesis of Alzheimer Disease provided by NCBI are 
analyzed. The correlation properties in the nucleotide density 
distribution among these DNA sequences are explored. 
Calculating the sample DFA exponent based on seven different 
mapping rules corresponding to four different pathogenesis of 
Alzheimer Disease, DFA exponent a is significantly larger than 
1.5. The main outcome of this study reveals existence of the 
correlation in the pathogenesis of Alzheimer Disease. This 
result points to need a new direction for analyzing and 
understanding the intrinsic structures of DNA sequences. 

Index Terms — DFA; Correlation; pathogenesis of Alzheimer 
Disease. 

I. Introduction 

Alzheimer Disease (AD) is the most common form of 
dementia and the most frequent degenerative brain disorder 
that humans encounter in old ages, The risk of developing 
AD substantially increases after age of 65[1].AD is rapidly 
becoming one of the major universal healthcare problems. 
While the cause of this disease remains unknown, there is 
evidence for substantial genetic influence[l]-[5]. With the 
unclear pathogenesis, several hypotheses about the 
pathogenesis of AD has been proposed, such as the 
abnormal protein hypothesis, cholinergic hypothesis, 
oxidative stress theory and the estrogen hypothesis, etc. 
Mutations in three genes, a myloid precursorprote in ( APP ), 
presenilin 1 and 2( PSEN1 , PSEN2 ), result in an early 
onset, autosomal dominant form of the disease beginning in 
the third or fourth decade. The s4 allele of apolipoprot ein E 
(APOE) increases the risk of both sporadic and familial AD 
occurring later in life around the sixth decade. Each of these 
genes is involved in the production or processing of the 
amyloid p peptide, which is deposited in the brain as dense 
plaques that are characteristic of the disease [3]. At present, 
however, there are neither precise diagnostic approaches nor 
effective therapeutic agents available for Alzheimer Disease. 
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In the last decade or so there has been a ground swell of 
interest in unraveling the mysteries of DNA. In order to 
distinguish coding regions from non-coding ones, many 
approaches have been proposed. The correlation properties 
of nucleotides in DNA sequences were investigated as 
well[6]-[15]. C.K. Peng.et al found that there is long-range 
correlation in non-coding regions but not in coding regions 
by using the one-dimensional DNA walk model. At the 
same time, the nonlinear scaling method, such as 
complexity (Jun Xu,et al., 1994) and fractal analysis [13] -[17] 
were used. 

Detrended fluctuation analysis is a scaling analysis 
method providing a simple quantitative parameterthe scaling 
exponent ato represent the correlation properties of a signal. 
The advantage of DFA over many other methods is that it 
permits the detection of long-range correlations embedded 
in seemingly nonstationarity time series, and also avoids the 
spurious detection of apparent long-range correlations that 
are an artifact of nonstationarity. The DFA method has been 
applied in many fields, such as cardiac dynamics, 
bioinformatics, economics, meteorology, material science 
and biology, etc [17] -[23]. The correct interpretation of the 
scaling results obtained by the DFA method is crucial for 
understanding the intrinsic kinetics of the systems under our 
study. In fact, for all systems where the DFA method was 
applied, there are many problems that remain unexplained. 
Two of the common challenges are that the correlation 
exponent is not always a constant and crossover often exists. 
The main purpose of this paper is to analyze the scaling 
behavior of the fluctuations in the pathogenesis of 
Alzheimer Disease by means of detrended fluctuation 
analysis and to reveal the existence of correlation features in 
the pathogenesis of Alzheimer Disease. Moreover, we 
proposed a global measure of the DNA walker that has 
potential utility in analysis of the existence of correlation 
features in the pathogenesis of Alzheimer Disease. Although 
we did not discuss the possible mechanism of the 
pathogenesis of Alzheimer Disease, the results suggested 
existence of correlation in the pathogenesis of Alzheimer 
Disease. Thus the research has technically significance. 

This paper is divided into four parts: the first part is the 
introduction of the pathogenesis of Alzheimer Disease and 
DNA series analysis basic situation; in the second part, we 
briefly review materials and the detrended fluctuation 
analysis method; the third and four part are the results and 
discussion of our research. 
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II. Materials and Methods 


A. Data Resources 

We will use the tools of the World Wide Web to search the 
GenBank DNA sequence database 

( http://www.ncbi.nlm.nih.gov ). Homo sapiens amyloid beta 
(A4) precursor protein (APP), RefSeqGene on chromosome 
21(NCBI Reference Sequence: NG_007376.1, 

GI: 166795291); Homo sapiens apolipoprotein E (APOE), 
RefSeqGene on chromosome 19 [24]; Mus musculus 
presenilin-1 gene, alternatively spliced transcripts, complete 
cds, GenBank: AF007560.1, GL2463667; Mus musculus 
presenilin-2 gene, strain 129Xl/SvJchromosome 12 
CR A_2 1 1 000022007 779, whole genome shotgun 

sequence, GenBank: AAHY0 110 1600.1, GI:69874353. 

B. Mapping rules 

In) 

A DNA sequence 1 l) (i=l,2,...,L)of length L is 
comprised a series of 4 types of bases as following: 
adenine(A); thymine(T); guanine(G); and cytosine(C). In 
order to apply numerical methods to nucleotide sequence, 
we first prepare seven numerical sequences 

{ Ui \ corresponding to seven means of mapping the original 
nucleotide sequence onto a one-dimensional numerical 
sequences. Mapping rules [25], [26] used to convert DNA 
sequences into binary numerical sequences are displayed in 
table I . 


Table I 


Rule 

Assignment 

RY 

A or G=1 

Cor T=-l 

A 

A =1 

T, C or G=-l 

G 

G =1 

A, Tor C=-l 

T 

T =1 

A, C or G=-l 

C 

C =1 

A, T or G=-l 

SW 

Cor G=1 

A or T=-l 

KM 

A or C=1 

Gor T=-l 


The RY rule has been widely used, but the other rules 
have also been applied. 


C. Detrended fluctuation analysis 

The method of detrended fluctuation analysis has proven 
useful in revealing the extent of long-range correlations in 
time series. It provides a simple quantitative parameter-the 
scaling parameter a, which is a signature of the correlation 
properties of the signal. The DFA method is initiated with 
dividing a time series y(k) of length N into N/n 
nonoverlapping boxes (also called windows). In each box of 
length n, a least squares line is fit to the data (representing 
the trend in that box). The y coordinate of the straight line 
segments is denoted by yn(k). We detrend the integrated 
time series y(k) by subtracting the local trend yn(k) in each 
box. The root-mean-square fluctuation of this integrated and 
detrended time series is calculated by Eq.(l). 

F(n)=p[yW-y n (k )] 2 

i k=i (i) 

This computation is repeated over all time scales (box 
sizes) to characterize the relationship between F(n) and n, 

Fin) ~ n a 

namely v J ,the average fluctuation, as a function 
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of box size. Typically, F(n) will increase with box size n. A 
linear relationship on a log-log plot indicates the presence of 
power law (fractal) scaling. Under such conditions, the 
fluctuations can be characterized by a scaling exponent ^ , 
the slope of the line relating log F(n) to log n[19]. The 
possible result that^ =0.5 indicates that the changes in the 
values of a time series are random, namely uncorrelated 
with each other. If 1> ^ >0.5, the signal is positive 
persistency (correlation); while if 0< & <0.5, the signal is 
antipersistent. If ^ >1, correlations exist but cease to be of a 
power-law form; ^ =1.5 indicates brown noise. 

III. Results 

A useful way of analyzing patchiness which arised from 
the heterogeneous purine-pyrimidine content is DNA walk, 
defined as table 1 above. The displacement of walker after n 

>’(>D = 22 m. 

steps, y(n) is defined as i=1 and will display on a 

graph of y(n) vs n . 

A. The characteristic of DNA walk 
We find apparent patchiness in real DNA sequences — 
both in the noncoding and coding regions. Figurel(a-g) 
shows a representative DNA walk for different-rule 
mappings of amyloid beta precursor protein (APP) 
sequences. 






Figl.(a-g) shows a representative DNA walk for 
different-rule mappings of amyloid beta precursor protein 
(APP) sequences. 

B. The characteristic of brown noise and white noise 
We use simulation method and produce two series, 
namely brown noise and white noise.The displacement of 
walker after n steps y(n) is used, Fig2. shown brown noise 
and white noise walker. 
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Fig2. shown brown noise and white noise 


C. Detrend fluctuation analysis results 
A series of double-logarithmic plots of the detrended 
fluctuation analysis exponent of DNA walk was 
demonstrated for a different rule mapping of amyloid beta 
precursor protein (APP) sequences.it is shown in Fig. 3. 
Similarly, the exponent was obtained from the slope of the 
line fitted through the data. Values of a calculated for four 
different amyloid beta (A4) precursor protein (APP) ^ 
apolipoprotein E> presenilin-1 gene and presenilin-2 gene 
of different rule are shown in Table II - V . 





Fig. 3. A series of double-logarithmic plots of the 
detrended fluctuation analysis exponent of DNA walk for a 
different rule mapping of amyloid beta precursor protein 
(APP) sequences are shown. 

Table. II amyloid beta precursor protein (APP) sequences 



Results above shows that Correlations Exist in potential 
relevance to the pathogenesis of Alzheimer Disease. 
Simulation rand noise and brown noise, is according to 
computing the detrended fluctuation analysis exponent ^ . 
The results show: Rand noise ^ =0.5; Brown noise ^ =1.5; 
it does not exist self- similar. Double-logarithmic plots of the 
detrended fluctuation analysis exponent of rand noise and 
brown noise RY rule mapping sequences were shown in 
Fig.4. The exponent was obtained from the slope of the line 
fitted through the data. A different rule mapping of the 
pathogenesis of Alzheimer Disease of detrend fluctuation 
analysis exponent were shown in Fig 5. 



Fig.4.Double-logarithmic plots of the detrended 
fluctuation analysis exponent of rand noise and brown noise 
RY rule mapping sequences were shown. 


values of a 


Rule 

Genome 

a 

RY 

APP 

1.6969 

A 

APP 

1.9968 

G 

APP 

1.9868 
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Fig 5. A different rule mapping of the pathogenesis of 
Alzheimer Disease of detrend fluctuation analysis exponent 
was shown. 


IV. Discussion 

We have examined the detrended fluctuation analysis 
exponent of DNA sequences in four difference mutution 
genes. The detrended fluctuation analysis exponent with 
different power law decays has been clearly distinguished. 
The result indicated that the existence of correlation in the 
system. The time series result was subjected to the 
detrended fluctuation analysis methods. The technique 
shows that the correlation properties of the DNA sequences. 
The detrended fluctuation analysis exponent is above 1.5. 
Although for the detrended fluctuation analysis, A power- 
law relationship between F(n) and n indicates scaling with 
an exponent a, F(n) ~ na. Notice that such a process exists a 
power-law. 

From the results we find that different nucleotide 
sequence representations will get different results. The 
detrended fluctuation analysis exponent ordered by 
representing of amyloid beta precursor protein ^ 
apolipoprotein E ^ presenilin-1 and presenilin-2 the 
sequences as A rule,G rule, C rule,T rule is almost 2. The 
detrended fluctuation analysis exponent value are computed 
above 1.5 by the other rule. In general, the detrended 
fluctuation analysis exponent value exhibit for 0.5 and 1.5 
from “random noise” sequence and “brown noise” sequence. 

In this paper the correlation properties in the whole 
genomes of amyloid beta (A4) precursor protein (APP)^ 
apolipoprotein presenilin-1 gene^ presenilin-2 gene are 
verified. The results of this paper suggest that the 
asymptotic correlation property is one of the natural 
properties that DNA sequences possess, and is directly 
related to the structure and function of the whole DNA 
molecule. There are still many discussions concerning the 
biological meanings and the origins of the correlation 
properties in the DNA sequences. Such as, APP gene 
encodes a cell surface receptor and transmembrane 
precursor protein that is cleaved by secretases to form a 
number of peptides. Some of these peptides are secreted and 
can bind to the acetyltransferase complex APBB1/TIP60 to 
promote transcriptional activation, while others form the 
protein basis of the amyloid plaques which are found in the 
brains of patients with Alzheimer disease. Mutations in this 
gene have been implicated in autosomal dominant 
Alzheimer disease and cerebroarterial amyloidosis. Multiple 
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transcript variants encoding several different isoforms have 
been found for this gene [27]. Chylomicron remnants and 
very low density lipoprotein (VLDL) remnants are rapidly 
removed from the circulation by receptor-mediated 
endocytosis in the liver. Apolipoprotein E, a main 
apoprotein of the chylomicron, binds to a specific receptor 
on liver cells and peripheral cells. ApoE is essential for 
normal catabolism of triglyceride-rich lipoprotein 
constituents [24] .Transcriptional regulation of the mouse 
presenilin-1 gene[28],A comparison of whole-genome 
shotgun-derived mouse chromosome 16 and the human 
genome[29], These properties are considered to be related to 
the construction of the higher order structure of the DNA 
molecule. 

This paper is preliminary with respect to Correlations 
Exist in potential relevance to the pathogenesis of 
Alzheimer Disease. Its aim is to provide the means for the 
analysis of fluctuation properties of DNA sequences. 
Although the research results show Correlations Exist in the 
pathogenesis of Alzheimer Disease, This may lead to a 
series of questions. What is the physical phenomena and 
possible mechanism of correlation in the pathogenesis of 
Alzheimer Disease? How to categorize different types of the 
pathogenesis of Alzheimer Disease? How to understand the 
function of the pathogenesis of Alzheimer Disease? How to 
build the pathogenesis of Alzheimer Disease kinetics model 
systematically? Although the data tell us that correlation 
does exist in the pathogenesis of Alzheimer Disease, we will 
still study the physical properties of the pathogenesis of 
Alzheimer Disease. What kind of distribution form do these 
the pathogenesis of Alzheimer Disease have? All these 
studies should be the future direction of research. 
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